www.ramoncasares.com 20150812 



Syntax 1 



Syntax Evolution: Problems and Recursion 

Ramon Casares 

We are Turing complete, and natural language parsing is decidable, 
so our syntactic abilities are in excess to those needed to speak a 
natural language. This is an anomaly, because evolution would not 
keep an overqualified feature for long. We solve this anomaly by 
using a coincidence, both syntax and problem solving are comput- 
ing, and a difference, Turing completeness is not a requirement of 
syntax, but of problem solving. Then computing should have been 
shaped by evolutionary requirements coming from both syntax and 
problem solving, but the last one, Turing completeness, only from 
problem solving. So we propose and analyze a hypothesis: syntax 
and problem solving co-evolved in humans towards Turing com- 
pleteness. Finally, we argue that Turing completeness, also known 
as recursion, is our most singular feature. 

Keywords: language evolution, syntax, problem solving, Turing com- 
pleteness, recursion. 

§1 Introduction 

§1.1 A Mystery 

%l ■ My investigation started with an anomaly: our syntactic abilities are in excess to 
those needed to speak a natural language. I knew since long ago that we are Turing 
complete, and that natural language parsing is decidable, but while I was aware of the 
anomaly, it was a mystery to me why would evolution keep an overqualified feature for 
such a long time. 

%2 ■ This paper is the best way that I have found to explain how to solve the mystery. 
My solution is based on a coincidence, both syntax and problem solving are computing, 
and on a difference, Turing completeness, also known as recursion, is not a requirement 
of syntax, but of problem solving. Then computing would have been receiving evolution- 
ary requirements from both, from syntax and from problem solving, and the last one, 
recursion, would not be a requirement of syntax, but of problem solving. 
%3 ■ Perhaps my solution is not the best one, and it is surely incomplete. I don't know 
the details of the hypothetical co-evolution of syntax and problem solving that solves the 
mystery, so I invite you, dear reader, to enhance it. 

%4 ■ I hope the proposal in this paper makes sense to you, but even if it were completely 
wrong, the paper would still have some value, because it states a coincidence that asks 
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for an explanation, and an anomaly that needs to be solved. I will try to explain the 
coincidence elsewhere, but I have tried to solve the anomaly here. All anomalies signal 
a theoretical failure, and I think this one is especially important because it seems to me 
that this one is precisely the anomaly that is behind Chomsky's refusal to study syntax 
evolution. And Chomsky is right even when he is wrong! 

§1.2 Contents 

11 ■ The main course of the paper starts with the coincidence. In Section §2 we examine a 
seminal paper by Chomsky (1959) concluding that syntax is computing. In a similar way, 
in Section §3 we examine a founding paper by Turing (1936) concluding that problem 
solving is computing. Before using this coincidence, we note that the best for problem 
solving is the maximum computing power, and the maximum computing power is that 
of Turing completeness, also known as recursion. 

12 • So in Section §4 we investigate recursion. We see, in Subsection §4.1, that there are 
several versions of recursion that are mathematically equivalent, and each one provides 
its own terminology. So, in Subsection §4.2, we look for the properties that are common 
to all versions of recursion, taking note of one: undecidability. Then, in Subsection §4.3 
we learn that any Turing complete device needs to understand a recursive language to be 
able to calculate any recursive function. But we discover in Subsection §4.4 that recursive 
languages are peculiar, because its syntax parser can be, and should be, decidable, and 
therefore it cannot be, and should not be, recursive. At this point, we are ready to solve 
the anomaly by using the coincidence and the difference: recursion is not a requirement 
of syntax, but of problem solving. Noting that recursion is a syntactic property, and 
that recursion is not a syntactic requirement, the conclusion is that recursion is not a 
requirement of language, and then we should reject the 'FLN is recursion' hypothesis 
by Hauser, Chomsky, and Fitch (2002), ending this way Subsection §4.5. Instead, in 
Subsection §4.6, we propose our own hypothesis: syntax and problem solving co-evolved 
in humans towards recursion. 

13 • Our hypothesis implies that some features were incorporated to our natural language 
syntax because they enhance our problem solving abilities, and then our survival fitness, 
and not, or not only, because of its syntactic convenience. So in Section §5 we analyze 
some of these features. An interesting finding is that syntax, that is, some sentence 
structure, is needed to express problems. Others are better known, for example that the 
simplest open data structure is the binary tree, or that the only instruction needed by a 
universal computer is the conditional, or that problem solving is recursive. 

14 • In the last section, §6, we explore some implications of the hypothesis. Firstly, in 
Subsection §6.1, we investigate the impact of Turing completeness on evolution, finding 
that creativity explodes because, after achieving Turing completeness, performing new 
ways of solving the survival problems becomes cheap and available to single individuals, 
while, before achieving it, any single new way of solving them required genetic changes 
on species over evolutionary time spams. Secondly, in Subsection §6.2, we present the 
co-evolution of the computing machinery that is common to syntax and problem solving 
as a process that tries to accommodate requirements from both, from syntax (communi- 
cation) and from problem solving (thinking), and we observe that the same presentation 
is valid for the co-development of syntax and problem solving in children. Then, in Sub- 
section §6.3, we try to order the steps of the co-evolution, but only the first and the 
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last ones are somewhat plausible, especially the last step, because we know that Turing 
completeness is not a requirement of syntax, but of problem solving. In any case, syntax 
was instrumental in achieving Turing completeness, and then in causing the explosion 
of creativity of our own species, and because of this, in Subsection §6.4, we agree with 
Chomsky that syntax is a hiatus that separates our own species from the rest. And lastly, 
in Subsection §6.5, we conclude. 

§2 Syntax 
§2.1 Grammar 

%l ■ Chomsky (1959) presents a hierarchy of grammars. A grammar of a language is a 
device that is capable of enumerating all the language sentences. And, in this context, 
language is the (usually infinite) set of all the valid syntactic sentences. 
%2 ■ At the end of section 2 in that paper, we read: "A type 0 grammar (language) 
is one that is unrestricted. Type 0 grammars are essentially Turing machines". At the 
beginning of section 3, we find two theorems. 

Theorem 1. For both grammars and languages, type 0 D type 1 D type 2 D type 3. 
Theorem 2. Every recursively enumerable set of strings is a type 0 language 
(and conversely). 

Then theorem 2 is explained: "That is, a grammar of type 0 is a device with the 

generative power of a Turing machine." 

1f3 • From the two theorems we can deduce three corollaries. 

Corollary 1. The set of all type 0 grammars (languages) is equal to the set of all 
grammars (languages). 

This is because, according to theorem 1, type 0 is the superset of all grammars 
(languages). 

Corollary 2. For each Turing machine there is a type 0 grammar (and conversely). 
This is equivalent to theorem 2, but in terms of grammars (devices) instead of 
languages (sets). 

Corollary 3. For each Turing machine there is a grammar (and conversely). 
This results by applying corollary 1 to corollary 2. 

§2.2 Syntax Definition 

%l ■ The third corollary shows that 'Turing machine' and 'grammar' are equivalent devices, 
by which 'computing' and 'syntax' are equivalent concepts. Computing is equivalent to 
syntax, rather than to language, because language in Chomsky (1959) refers only to 
syntax. It does not refer to semantics, because meanings are not considered, nor to 
pragmatics, because intentions are not considered. 

Syntax = Computing 

12 • So now we will provide the definition of syntax that follows from Chomsky (1959) 
after adjusting for this deviation towards language: syntax consists of transformations 
of strings of symbols, irrespective of the symbols meanings, but according to a finite set 
of well defined rules; so well defined as a Turing Machine is. This definition of syntax is 
very general and includes natural languages syntax, just replacing symbols with words and 
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strings with sentences, but it also includes natural languages morphology or phonology, 
taking then sub-units of words or sounds. 

13 ■ As an aside, please note that this definition of syntax manifests, perhaps better 
than other definitions, what we will call the syntax purpose paradox: syntax, being 
literally meaningless, should be purposeless. It is easy to underestimate the value of 
some mechanical transformations of strings of symbols. 

§3 Problem Solving 

§3.1 Computing and Problems 

11 • The connection between computing and problem solving goes back to the very foun- 
dation of computing. Turing (1936) defines his machine in order to prove that the Ent- 
scheidungsproblem, which is the German word for 'decision problem', is unsolvable. After 
defining the Turing machine, he shows that there is not any Turing machine that can 
solve the problem. Note that this proof is valid only under the assumption that Turing 
machines can solve any solvable problem. 

Problem Solving = Computing 

12 ■ It is even more explicit Post (1936), who writes that he has in mind a general problem, 
and then he goes on to define a 'problem solver' which is very similar and equivalent to 
a Turing machine. 

13 ■ From then, the equation between computing and problem solving holds, even if it is 
not expressed. For example, for Simon and Newell (1971) a theory of human problem 
solving gets credit only if a computer program simulates precisely the behavior of a person 
solving the same problems. 

§3.2 Turing Completeness 

11 ■ After assimilating that problem solving is computing, the next step is to assume 
that solving more problems provides more survival possibilities, and therefore that the 
evolution of computing was driven by the evolution of problem solving, see Casares (2015). 
In addition, if problem solving is computing, then the maximum problem solving power 
is the maximum computing power, and therefore the last step in this evolutionary thread 
is to achieve the maximum computing power, which is to achieve Turing completeness. 

12 • By definition, a device is Turing complete if it can compute anything that any Turing 
machine can compute. The Turing machine, as it was presented by Turing (1936) himself, 
models the calculations done by a person, meaning that we can compute whatever a 
Turing machine can compute, and therefore we are Turing complete. 

13 • Turing completeness is the maximum syntactic power. This follows directly from the 
equation between syntax and computing seen in Subsection §2.2. 

14 • The prototype of Turing complete device is a universal Turing machine, also defined 
by Turing (1936), but there are others, each one associated with one version of recursion. 
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§4 Recursion 

§4.1 Versions of Recursion 

%l ■ There are three classical versions of recursion: 

o Recursive functions, due to Godel, Herbrand, and Kleene, where a recursive function 
can be defined without reference restrictions, and, in particular, the definition of a 
function can refer to itself. Its classical reference is Kleene (1936a). 

o Lambda functions, due to Church, Kleene, Rosser, and Curry, which are higher-order 
functions, that is, functions on functions to functions. In lambda-calculus everything 
is a function, or a variable. Its classical reference is Church (1935). 

o Computable functions, due to Turing, and Post, that are ruled transformations of 
strings of symbols. There is a universal computer for which any computer is a string 
of symbols. Its classical reference is Turing (1936). 
There are now more versions of recursion, but here we will manage with these three. 
%2 ■ Although apparently very different to each other, the three are equivalent: 

o Kleene (1936b) showed that every lambda function is recursive, and the converse. 
This means that for each lambda function there is a recursive function that performs 
the same calculation, and the converse, and therefore lambda functions are equivalent 
to recursive functions. Or even better, it means that whatever a lambda-calculus 
interpreter can calculate, it can also be calculated by an ideal (error free and eternal) 
mathematician using recursive functions, and the converse. 

o Turing (1937) showed that every lambda function is computable, and that every 
computable function is recursive. This means that for each lambda function there 
is a Turing machine that performs the same calculation, and that for each Turing 
machine there is a recursive function that performs the same calculation, and there- 
fore, together with Kleene (1936b), computable, lambda, and recursive functions are 
equivalent. 

Computable Function = Lambda Function = Recursive Function 

Or better, speaking of devices, it means that whatever a universal Turing machine 
can calculate, it can also be calculated by a lambda-calculus interpreter, or by a 
mathematician using recursive functions, and the converse of both. 
%3 ■ These mathematical equivalences show that we can implement recursion in several 
ways. Each implementation has its own advantages and disadvantages, and then they 
are not completely equivalent from an engineering point of view. 

14 • We say that a universal Turing machine, a lambda-calculus interpreter, a mathemati- 
cian calculating recursive functions, or any other mathematically equivalent calculating 
device, is a universal computer. Only universal computers implement recursion. Then 
'universal computer' is synonymous with 'Turing complete device', and also with 'recur- 
sive device'. 



Universal Computer = Turing Complete Device = Recursive Device 
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§4.2 Properties of Recursion 

11 • Recursive devices have some properties in common: incompleteness (Godel 1930), 
undecidability (Church 1935), and extendability (Turing 1938); all three summarized by 
Post (1944). 

12 ■ Note that these properties are properties of devices, and not properties of functions. 
A recursive function can be decidable, but all recursive devices are undecidable. This 
can be a source of confusion, because only the most capable devices are recursive, while 
even the simplest functions are recursive. An example of simple recursive function is the 
literal identity function that just returns what it takes, and then it is a 'no operation' 
instruction that, in fact, does not compute anything. To prevent this confusion, except 
when we refer explicitly to recursive functions, we will always use recursion, or recursive, 
to refer to the property of devices. 

13 • So recursive devices are undecidable, and then if you can prove that a device, whatever 
the arguments it takes, always completes the calculations in a finite time, then you have 
proved that the device does not implement recursion, or, in other words, that it is not 
Turing complete. The converse is not true; that a computation does not complete does 
not imply that the computer is Turing complete. 

§4.3 Recursive Language 

11 • A recursive device can compute any recursive function, but, to do it, it needs the 
expression of the function. Then, each universal computer understands a way to express 
any possible recursive function. We will call this language that every Turing complete 
device needs to express any recursive function a recursive language. 

12 • For example, a typical lambda-calculus interpreter understands a recursive language 
A defined inductively this way: 

x e V c A 

x G V x G V 
x G V, M G A => (Ax M) G A 
M, N G A (MN) G A 

This means that in lambda-calculus there are only: an infinity of variables built by 
priming, which belong to the set V; lambda-abstractions, that is, function definitions, 
(Xx M); and function applications, (MN). These last two are ordered pairs that grow 
into binary trees by induction. 

13 • Other universal computers use other syntaxes, so implementing binary trees is not 
a requirement for recursion. For example, a universal Turing machine understands the 
transition table of the Turing machine to emulate. That table is a list of conditionals: 
if the current internal state is S{, and the symbol read is r (r can also be blank), then 
write symbol w (w can be equal to r or not, even blank), go to internal state s 0 (s 0 can 
be equal to si), and move to the right, or to the left, or halt. 

14 • While these Turing complete devices implement complete syntax engines, they need 
also semantics. Syntax alone is not enough, and some semantics are needed. It is not 
enough that the expressions represent the functions, the Turing complete device has 
to understand what the expressions mean to produce the results that the represented 
functions produce. Functional semantics are needed to implement a complete syntax! 
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15 • In this paper, we will consider functional semantics part of syntax, rather than part 
of semantics, because functional semantics only provide the meanings of functions, and 
functions are the core of syntax. To prevent mistakes, we will call a language composed 
only by syntax and functional semantics a syntactic language. Then, every recursive 
language includes a syntactic language. 

§4.4 Parsing 

11 ■ It is important to note that, while recursive languages have to be limitless, in order 
to accommodate any possible recursive function, it is possible, and beneficial, that the 
parsing of its syntax be always completed in a finite time, as it is the case of the two 
examples presented in the previous subsection, §4.3. More precisely, while any recursive 
language is undecidable, its syntax can be, and should be, decidable. 

12 ■ This explains an otherwise paradoxical fact: our syntactic abilities are in excess to 
those needed to speak a natural language. We are Turing complete, which is the maximum 
syntactic capacity, as required for better problem solving, though the natural language 
syntax parsers are not Turing complete, to assure parsing completion. In other words, 
recursion is not a requirement of natural language syntax, and being recursion a syntactic 
property, we conclude that recursion is not a requirement of language. 

13 • We will repeat it just because this is the key of the paper; once this is assimilated, 
everything else is pretty straightforward. Both syntax and problem solving are computing, 
but while recursion, which is the property that signals the maximum computing power, 
is not desirable for a syntax, not even for the syntax of a recursive language, the very 
same recursion is the most desirable property for problem solving. 

§4.5 Human Uniqueness 

11 ■ Because parsing binary tree data structures does not require Turing completeness, 
then a species that is not Turing complete can use binary tree data structures. On the 
other hand, it seems that we are the only Turing complete species, that is, the only one 
implementing recursion. 

12 • To prove that a device is Turing complete you have to show that it can compute any 
recursive function. Therefore, to be qualified as Turing complete, a device has to take 
the expression of any recursive function in some language, and any data, and then return 
the right result. This implies that a recursive species needs a recursive language. 

13 • An indirect evidence that we are the only Turing complete species is that we are 
invading nearly every ecological niche, being a plague at a planetary scale, because this 
shows that our problem solving abilities are too good, and qualitatively better than the 
abilities of any other species. 

14 • If we are in fact the only Turing complete species, then recursion is uniquely human. 
It is also true that any human organ is also uniquely human, that is, the human eye is not 
like any other, and an anthropologist can distinguish a human bone from any non-human 
bone. But perhaps recursion is different, because it is a property that our species homo 
sapiens has, and every other species has not, and because recursion is a singular property, 
as explained below in Subsection §6.1. 

15 • If recursion is a requirement of problem solving, and by this we mean that recursion 
has some survival value for problem solving, which is very likely true because of the 
recursive nature of problems (see Section §3), then the hypothesis that 'what is uniquely 
human and unique to the faculty of language — the faculty of language in the narrow 
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sense (FLN) — is recursion', by Hauser, Chomsky, and Fitch (2002), is only partly true. 
While our species seems to be the only Turing complete one, the recursive machinery 
would not be used only by language. 

§4.6 Our Hypothesis 

%l ■ Even if the 'FLN is recursion' hypothesis is not completely true, it touches one of 
the most important questions about language and syntax: their relation to recursion. To 
fix that hypothesis, we dare to state a new one that takes into account problem solving. 
Our hypothesis is that 'syntax and problem solving co-evolved in humans towards Turing 
completeness'. 

%2 ■ The argument goes like this: Solving more problems provides more survival opportu- 
nities, and then, because of the recursive nature of problems, this thread of evolution goes 
naturally towards Turing completeness (Section §3). And Turing completeness requires a 
syntactic language in which to express and mean any recursive function (Subsection §4.3). 
Then our syntax should have evolved to satisfy the requirements of problem solving. 
13 • To be clearer, we agree with Chomsky and the 'FLN is recursion' hypothesis in the 
following points: 

o we are the unique recursive species, 

o we are the unique species with a recursive language, 

o recursion is a syntactic property, and 

o recursion is not a requirement of syntax, nor a requirement of language. 
But then Chomsky does not want to go further. In fact, he cannot go further because, if 
recursion is not a requirement of language, meaning that recursion has not survival value 
for language, and recursion is exclusively a language property, as stated by the 'FLN is 
recursion' hypothesis, then recursion cannot have an evolutionary explanation. It is very 
illustrative to read Chomsky (2000), pages 161-163, where he writes about the evolution 
of language, and problem solving! 
%4 ■ Additionally, for us, 

o recursion is a requirement of problem solving, 
and then recursion can have an evolutionary explanation. And then we can state our 
hypothesis: 'syntax and problem solving co-evolved in humans towards recursion'. 



§5 Evolution 
§5.1 Requirements 

1fi • It is true that we can express any recursive function in our natural language, and 
that by reading the expression we can understand how the function works, to the point of 
producing the right results. This is just a different way to state that our natural language 
is recursive. On the other hand, problem solving achieves its maximum power when 
Turing completeness is achieved, and Turing completeness is mathematically equivalent to 
recursion. For our hypothesis, this happens because problem solving and syntax are both 
computing, and therefore computing was shaped by taking evolutionary requirements 
from both, and then while Turing completeness is not a requirement of syntax, our 
natural language is nevertheless recursive because Turing completeness is a requirement 
of problem solving. In other words, some modifications of language were selected, totally 
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or partly, because they enhanced our problem solving abilities increasing our survival 
fitness. 

12 • To better assess our hypothesis, in this Section §5 we will analyze those requirements. 
Our guide to problem solving in this investigation will be Casares (2015). 

§5.2 Interrogative Pronoun 

11 ■ To express a problem we have to refer to its freedom and to its condition. To name 
the freedom we have to use a word that does not refer to anything, that is, it has to be 
a word free of meaning. For example, if the problem is that we do not know what to do, 
then its more direct expression in English is 'what to do?'. In this sentence, the word 
'what' does not refer to anything specifically, but it is a word free of meaning and purely 
syntactical. 

12 • In mathematics, the typical way to express the freedom of a problem is to use the 
unknown x, which works the same way as the interrogative pronoun 'what'. For example, 
if we want to know which number is the same when it is doubled as when it is squared, 
we would write: 

x? [2x = x 2 }. 

The condition is the equality [2x = x 2 ], in this case. Equality is a valid condition because 
it can be satisfied, or not. 

§5.3 Sentence 

11 • The x is just a way to refer to something unknown, so we could use any other expedient 
just by indicating it with the question mark (?). This means that 

V* [2y = y 2 ] 

is exactly the same problem. We call this equivalence a-conversion. 

12 • It is important to note that the unknown has to be part of the condition, in order to 
determine if a value is a solution to the problem, or not. In the condition, the unknown 
x is a free variable, and therefore the condition is an open expression, that is, a function. 

13 • In summary, in order to refer to the freedom of a problem we have to use free variables, 
which are words free of meaning that do not refer to anything. These free words are useless 
by themselves, we can even substitute one for another using an a-conversion, so they have 
to be combined with other words to compose the condition of the problem. We will call 
this structure of words a sentence. All things related to the sentence, as, for example, 
the rules for word combination, are what is usually called syntax. 

§5.4 Definition 

11 • We have just seen why we need sentences, that is, syntax, to represent a problem, 
and why separate words are not enough. We have also seen two types of word: semantic 
words with some meaning, and syntactical words without any meaning. Well, although 
it seems impossible, there is a third type of word: defined words. 

12 • A defined word is just an abbreviation, so we could go on without them, but they 
are handy. That way we substitute a word for a whole expression. We can, for example, 
define a word to refer to a problem: 



q := (x? [2x = x 2 ]). 
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§5.5 Ordered Pair 

1fl • The resolution of the problem x? [2x = x 2 } can go on this way: 



[2x 



[2x = x 2 } 




- 2x = x 2 - 


-2x] 


[0 = xx - 


-2x] 


[0 = (x - 


-2)x] 


= 0] V [x = 


0] 


= 2]V[x = 


0] 


= 2] V [x = 


0] 


{2}U{0}: 


= {2,0} 



[x-2 + 2 



So it has two solutions, two and zero. In this case the resolution was achieved by analogy 
transforming the problem until we found two subproblems with a known solution, xl [x = 
2] and x? [x — 0], which we could then resolve by routine. 

%2 ■ To represent each transformation the simplest expedient is the ordered pair, (s,t), 
where s and t represent two expressions: s before being transformed, and t once it has been 
transformed. Note that we can use a single ordered pair to express each transformation, 
or the whole sequence. For example, the complete sequence can be summarized in just 
one ordered pair, which, in this case, describes how to resolve the problem by routine, 
because s is the problem and t its solution: 



(x?[2x = x\ {0,2}) . 



§5.6 Function 

11 • We can also resolve by trial and error. In the trial we have to test if a value satisfies 
the condition, or not, and the condition is an open expression. To note mathematically 
open expressions, also known as functions, we will use lambda expressions, \ x £,, where x 
is the free variable, and £ the open expression. In the case of our problem: 

X x [2x = x 2 ]. 

%2 ■ Now, to test a particular value a, we have to bind that value a to the free variable 
inside the condition. We also say that we apply value a to the function A^. In any case 
we write the binding this way: \ x £,(a). We can abbreviate the expression naming the 
function, for example / := \ x !;, to get the typical f(a). In our case, 'to test if number 
2 is equal when it is doubled to when it is squared' is written \ x [2x = a; 2 ] (2). And to 
calculate if a value satisfies the condition, we replace the free variable with the binding 
value; this process is called /3-reduction. In our case we replace x with 2 (x := 2), this 
way: 

X x [2x = x 2 ](2) ->■ [2.2 = 2 2 ] ^[4 = 4]^ true. 

13 • A condition can take only two values, which we will call true and false. In case 
we want to make a difference depending on the condition, and what else could we want?, 
then we have to follow one path when the condition is satisfied (true), and a distinct 
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path when the condition is not satisfied (false). To achieve this, a conditional command 
if is used: 

if (condition) then (true case) else (false case) . 

14 • We can write down completely any trial just by using this two expedients: binding 
values to free variables in open expressions, and a conditional command, as if. Suppose, 
for example, that we guess that the solution to our problem is one of the first four 
numbers, in other words, that the solution is in set {1,2,3,4}, and that we want to try 
them in increasing order. Then we should nest four if commands: 

if \ x [2x = x 2 ](l) then 1 
else if \ x [2x = x 2 ](2) then 2 

else if X x [2x = x 2 ](3) then 3 

else if X x [2x = x 2 ](4) then 4 
else false . 

§5.7 Condition 

%i ■ A known condition is a function with two possible outcomes, the loved true and the 
hated false. This means that each time we apply it we get a true or a false. For 
example, defining / := X x [2x = x 2 }, we get: 

/(0) ->■ TRUE 
/(l) ->■ FALSE 
f{2) ->■ TRUE 
/(3) ->■ FALSE 
/(4) ->■ FALSE 

12 • What resolves definitively a problem is the inverse function of the condition of the 
problem. Because the inverse function just reverses the condition, from a 0 — > true to 
a true — > 0, so when we apply true to the inverse condition we get the solutions, and 
when we apply false we get the set of the no-solutions. Thus we get: 

7(TRUE)^{0,2} 
/(FALSE) -> {1,3,4,...}. 

13 • We can use the condition in both directions: the natural direction, when we apply 
a value to test if it satisfies the condition, and the opposite direction, when we want to 
know what values satisfy the condition. To express a problem is enough to write the 
condition and to indicate which are its free variables, and to resolve a problem is enough 
to apply the condition in the opposite direction. 

14 • It is too easy to say that the resolution of a problem is the inverse function of its 
condition; I have just done it. Unfortunately, it is nearly always impossible to calculate 
that inverse function, and in some cases the condition is unknown. So, finding a resolution 
to a given problem is a problem, its metaproblem. 
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15 • Finding a resolution to a problem is a problem because it has its two mandatory 
ingredients. There is freedom, because there are several ways to resolve a problem, and 
there is a condition, because not every resolution is valid, but only those that provide 
the solutions to the problem. And then, being a problem, we need a resolution to find 
a resolution to the problem. And, of course, to find a resolution to find a resolution to 
the problem we need, can you guess it?, another resolution. What do you think? Better 
than 'convoluted' say 'recursive'. 

§5.8 Tree 

11 • If we know the solution of a problem, then we can apply a routine, and write it as 
an ordered pair, as we have already done, (x? [2x = x 2 }, {0,2} ). But we can also write 
it as a function, using the command if: 

A,,- if 7r = (xl [2x = x 2 ]) then {0, 2} else false 

Here 7r is a free variable that we can bind to any expression, which is then compared (=), 
not against the problem, but against the expression of the problem that we know how to 
resolve (xl [2x = x 2 ]), and, if the comparison is successful, then we get its two solutions 
({0,2}). When comparing open expressions, remember a-conversion and /3-conversion. 

12 ■ I will not add anything new about trials, because we already know that we can express 
any trial just nesting command conditionals of bound functions. But note that frequently 
determining the best way to order the tests is a problem. 

13 • By analogy we transform a problem into another problem, or problems. Most times 
the outcome will be more than one problem, because 'divide and conquer' is usually a 
good strategy for complex problems. So, in general, the resolution of a problem will be 
a tree, being the original problem its trunk. If we use analogy to resolve it and we get, 
for example, four easier subproblems, then the tree has four main branches. But, again, 
from each branch we can use analogy, and then we get some sub-branches, or we can use 
routine or trial. We resolve by routine when we know the solution, so the subproblem is 
solved; these are the leaves of the resolution tree. Trial . . . 

14 • You got lost? Don't worry, even myself get lost in this recursive tree, and to what 
follows the only important thing to keep in mind is one easy and true conclusion: expres- 
sions that represent resolutions of problems have a tree structure, because they describe 
the resolution tree. You also have to know that it is enough to allow that the elements 
of a ordered pair be ordered pairs to be able to build binary trees with ordered pairs. 

§5.9 Recursivity 

11 • Resolution is a process that, when it is successful, transforms a problem into its 
solutions. 

i , Resolution c , , . 
Problem > Solution 

But this neat and tidy view hides a more complex resolution tree, one that is full of 
subproblems. Inside the tree we see partial resolutions, which are atomic transformations 
transforming a subproblem into a subproblem. 

12 • The solution to a problem can also be a problem or a resolution. For example, when 
a teacher is looking for a question to include in an exam, her solution is a problem. And 
when an engineer is designing an algorithm to implement some general type of electrical 
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circuits, her solution is a resolution. Note that, in this last case, a previous step could 
be a sub-optimal algorithm to be enhanced, and then one of the transformations would 
take a resolution to return a resolution; this also happens with solutions. 

13 ■ Any open condition is a problem, if we are interested in the values that satisfy the 
condition. An open condition is an open expression, and closed expressions are just open 
expressions with no free variables, that is, with zero free variables. In addition, you can 
close any open expression by binding all its free variables. So, with this in mind, we can 
treat open expressions as expressions. Not every open expression is an open condition, 
but only generalizing to full functions we can cope with the resolution to resolution 
transformation that the engineer above needed. 

14 ■ The conclusion is then that a partial resolution can start with any expression and 
can end with any expression. So a resolution has to be able to transform any expression 
into any expression. 



15 • Please note that the resolution can be the transformation, but also what is to be 
transformed, and what has been transformed into. We call this feature of transformations 
that can act on or result in transformations without any restriction recursivity. 

§5.10 Quoting 

11 • Recursivity needs a quoting expedient to indicate if an expression is referring to a 
transformation, or it is just an expression to be transformed. 

12 • Joke: 

— Tell me your name. 

— Your name. 

Who is answering has understood 'tell me "your name" ', which could be the case, al- 
though unlikely. In any case, quotation explains the confusion. 

13 • Quoting is very powerful. Suppose that I write: 

En cuanto llegue, el ingles me dijo: "J was waiting you to tell you something. . . . 



. . . And even if you don't believe me, it is true". Y se fue, sin dejarme decide ni 
una sola palabra. 

Technically, I would have written a book in Spanish! 

14 • Technically, using the quoting mechanism, all symbolic languages are one. As we 
learned from Chomsky (2000), page 7, there is only one language. And, as in the Spanish 
book, even if you don't believe me, it is true. 




[a three hundred pages story in English] 
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§5.11 Summary 

%l ■ The requirements found in this Section §5 are evolutionary, so they are not pre- 
conditions of design, but post-conditions. Each evolutionary requirement appears by 
chance, grows in the short term by a mixture of chance and fitness, is kept in the mid 
term because of its fitness, and is entrenched in the long term as part of a bigger structure, 
which is the recursive engine, in our case. 

%2 ■ From a mathematical point of view, the list of requirements found in this Section §5 
is redundant. For example, lambda-calculus does not need definitions, because it does 
everything with anonymous functions. But, for a function to refer to itself, we need 
names, and therefore to define recursive functions we need definitions! It is nearly the 
same with conditionals. Lambda-calculus does not need them because they can be defined 
as any other function, but the conditional is the only instruction that a universal Turing 
machine needs. That the list of requirements is redundant only means that evolution does 
not value succinctness, but efficiency And though the different versions of recursion are 
mathematically equivalent, its implementations are different from an engineering point 
of view. Then, by implementing more than one, the resolver has the opportunity to use 
each time the version that is best suited to the problem. 

%3 ■ From a very abstract point of view, all the requirements found in this Section §5 can 
be connected to the concept of function, which is the key concept of recursion. A function 
is the general form of conditional, because the result of the function is conditioned on 
the values it takes. A computable function calculated by a universal Turing machine is 
expressed just as a list of conditionals. But a function can also be expressed as a structure 
with holes that, depending of how the holes are filled, results differently. The holes are 
pronouns, and the rest of the structure are other words that compose the sentence. The 
ordered pair is arguably the simplest structure, and then a tree built from ordered pairs 
is surely the simplest open structure, and therefore the binary tree is the first candidate 
for the sentence. See that the Merge operation of the minimalist program builds binary 
trees. And, finally, we need recursion to get the most general functions that are still 
effectively calculable. 

%4 ■ The point of this Section §5 was to show some requirements from problem solving that 
our natural languages exhibit. This does not prove that they were originated by problem 
solving, and below in Subsection §6.3 we will argue that some were possibly not, but, 
in any case, they enhanced our problem solving abilities, and then these 'problematic' 
features of language enhanced our survival fitness because of problem solving. 

§6 Implications 

§6.1 Problems and Evolution 

1fi • We are Turing complete, but not all living individuals are Turing complete, so we 
might wonder what difference does this make. To start, we have to distinguish solving, 
or finding solutions, from resolving, or finding resolutions, where a resolution is a way of 
solving, that is, a process for solving problems, so mathematically, see Casares (2015), 
a resolution is a function that takes a problem and returns solutions, right and wrong 
solutions. So, being a function, a Turing complete individual can express and understand, 
that is, imagine, any resolution in her recursive language, while a more limited individual 
will apply its limited set of resolutions to any problem. The key point is that a single 
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Turing complete individual can imagine any possible resolution, that is, any possible way 
of solving a problem, and then she can try any of the imagined resolutions that returns 
right solutions, while an individual that is not Turing complete can only execute those 
resolutions that are implemented in the hardware of its body, mainly in the hardware of 
its brain. 

12 • Species that are not Turing complete need a genetic change to modify its set of reso- 
lutions, while Turing complete individuals can try new resolutions without any hardware 
change, but just by a software change. So the timing of creativity depends on evolution 
until Turing completeness is achieved, and it does not depend on evolution after that 
point for the species that achieve Turing completeness. We will call every point where 
an evolutionary path achieves Turing completeness an evolutionary singularity. 

13 • New behaviors related to the resolution of problems, such as feeding, mating, and, in 
general, surviving, should proliferate after an evolutionary singularity. And noting that 
a tool is the physical realization of a resolution, then an explosion of new tools should 
start whenever a evolutionary singularity happens. So the archaeological record of human 
tools should point to our evolutionary singularity. In summary, creativity is slow until 
an evolutionary singularity, and creativity explodes after every evolutionary singularity. 

14 • Nevertheless, Turing completeness is defined by a single condition: pan-computability. 
A universal Turing machine, which is the prototype of Turing completeness, is every 
Turing machine that can compute whatever any Turing machine can compute, but no 
more. This means that to perform any specific computation you can, either build the 
Turing machine specific for that computation, or write the program for that specific 
computation on a universal Turing machine. Either way the outcome of the computation 
will be the same, and the only differences would be that the first can run faster, and that 
the second can be implemented faster, once you have a universal computer. The second 
is better for modeling, that is, for imagining, because writing models is much faster than 
building models, but, again, the results of the computations are the same. 

15 • In other words, creativity is the only exclusive behavior of Turing complete problem 
resolvers. And this explains an elusive fact: every time a specific behavior is presented as 
uniquely human, it is later rejected when it is found in another species. The point is not 
that we behave in some specific way to solve a problem, but that we are free to imagine 
any way to solve our problems. Creativity is the mark of Turing complete resolvers. 

§6.2 Syntax and Problems 

11 • We could equate problem solving with syntax, because syntax is needed just to express 
problems, but mainly because both syntax and problem solving are computing. 

Syntax = Computing = Problem Solving 

And full problem solving requires universal computability, which is the maximum syntac- 
tic requirement. So full problem solving needs syntax, and needs all of it: full problem 
solving needs the complete syntax. 

12 • By equating syntax to problem solving we have solved the syntax purpose paradox: 
when a brain becomes a complete syntax engine, then that brain becomes a full resolution 
machine. So syntax is meaningless but very useful and its purpose is to solve problems 
in any possible way. 
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13 • Syntax and problem solving use the same computing machinery, but their particular 
requirements are different: better syntax is for better communication, while better prob- 
lem solving is for better thinking. Then our computing machinery should have evolved by 
satisfying requirements from syntax (communication), or from problem solving (think- 
ing), or from both. And then we should find problem solving requirements in syntax, 
and syntax requirements in problem solving, or, in fewer words, language and thinking 
co-evolved in humans. 

14 ■ The description by Vygotsky (1934) of the co-development of language and thinking 
in children can be seen as a possible model for the co-evolution of language and thinking 
under our hypothesis. In fact, at this very abstract level, the development of syntax and 
problem solving in children can use every argument used here to base our hypothesis, 
and then, the very same arguments can be used to base a parallel hypothesis: 'syntax 
and problem solving co-develop in children towards recursion'. This parallel hypothesis 
is easier to examine, and if it is found true, then the original hypothesis, which is based 
on the same arguments though applied to a different process, would gain some support. 

15 • Syntax recursivity is credited with making possible the infinite use of finite means, 
which is obviously true. But our wanderings in the problem resolution realm have shown 
us that there are other features provided by recursive syntax that are, at least, as im- 
portant as infinity; infinity that, in any case, we cannot reach. My own three favorites 
are: 1) sentences, or hierarchical tree structures; 2) functions, or open expressions with 
free variables; and 3) conditionals, or expressing possibilities. / cannot imagine how we 
would see the world if there were no conditionals, can you? 

§6.3 Syntax and Evolution 

11 • We have seen that syntax and problem solving co-evolved towards Turing complete- 
ness, because both are computing, and Turing completeness is the maximum computing 
power. But, how was that co-evolution?, that is, which one, syntax or problem solving, 
was driving each step of the co-evolution? 

12 • I would argue that one of the very first steps, the sentence, was driven by syntax. 
The reason is that ambiguity is reduced efficiently just by imposing some order to a set 
of words, and this is the case even without free variables, which was possibly the first 
reason why problem resolving needed sentences. For example, the first step of syntax 
could be very simple: just put the agent before every other word. This simple expedient 
by itself would prevent efficiently some ambiguities, explaining who did and who was 
done something. 

13 • I would argue that the last step, fulfilling universal computability, was driven by 
problem resolving. The reason, now, is that Turing completeness is useful for problem 
resolving to detach itself from hardware causing an explosion of creativity, but it is not 
so useful to natural language syntax. 

14 • I don't know which one, syntax or problem solving, was driving each of the other steps 
that provided each of the capabilities that are needed to implement a Turing complete 
brain. But what I take for sure is that we humans evolved to Turing completeness, not to 
achieve a full resolution machinery, which was a final prize, but because recursive features, 
such as sentences, open expressions, and conditionals, made thinking (problem solving) 
or communication (syntax), or both, more efficient, resulting in more fitted individuals. 
A more detailed study of the requirements found in Section §5, in addition to those found 
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in a grammar text book, will be needed to disentangle the co-evolution of syntax and 
problem solving. 

15 ■ In any case, it would be difficult to deny that syntax was, at least, instrumental in 
achieving Turing completeness, and therefore, that syntax was influential in reaching our 
evolutionary singularity. 

§6.4 Beyond the Singularity 

11 ■ These results should help us to reevaluate syntax. In language evolution there are 
two main positions regarding syntax; see Kenneally (2007), Chapter 15. The gradualist 
side defends a more gradual evolution, where syntax is just a little step forward that 
prevents some ambiguities and makes a more fluid communication; see Bickerton (2009), 
page 234. For the other side, led by Chomsky, syntax is a hiatus that separates our own 
species from the rest; see Hauser, Chomsky, and Fitch (2002). What position should we 
take? 

12 ■ The co-evolution of syntax and problem solving explains that, once our ancestors 
achieved Turing completeness, they acquired complete syntax, and their brains became 
full resolution machines. Then they could see a different the world. How much different? 
Very much. 

13 • Seeing the world as a problem to solve implies that we look for the causes that are 
hidden behind the apparent phenomena. But it is more than that. Being able to calculate 
resolutions inside the brain is thinking about plans, goals, purposes, intentions, doubts, 
possibilities. This is more than foreseeing the future, it is building the future to our own 
will. Talking about building, what about tools? A tool is the physical realization of a 
resolution, so, with full syntax we can design tools, and even design tools to design tools. 

14 ■ Thinking the world as a problem to solve implies also that we calculate how to use 
freedom in order to achieve our objectives. So, yes, we are free because of syntax. 

15 • Summarizing, the identification of syntax with problem solving explains why syntax, 
being so little thing, has made us so different. And, although syntax is more than just one 
operation, so we have probably acquired the whole syntax in more than one evolutionary 
step, my vote goes, anyway, to . . . Chomsky! 

§6.5 Conclusion 

11 • Our species is Turing complete. Therefore we must explain the evolution of Turing 
completeness. 

12 • Turing complete individuals can transform strings of symbols, irrespective of the 
symbols meanings, but according to any possible finite set of well-defined rules. It seems 
a nice capability but, being meaningless, not very useful. It could be, but syntax is 
also about meaningless transformations of strings of symbols according to rules. Turing 
completeness is then a pan-syntactical capability, because the syntax of a natural language 
does not need to follow any possible set of rules, but only one specific set of rules. 

13 • Syntax is very peculiar indeed, because syntax is a peculiarity of human language, 
which is our most peculiar faculty. For this reason, we can argue that syntax is what 
defines our species, and yet it seems improbable to explain how some mechanical trans- 
formations of strings of symbols have made us like we are. In addition, syntax is not our 
only peculiarity: is it a coincidence that we are the only species with a syntactic language 
and also the most creative? 

14 • But then, after realizing that Turing completeness is closely related to syntax, which 
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is a human peculiarity, we have to halt, because we cannot progress anymore without 
new inputs. In order to overcome that impasse, this paper introduces a new piece of the 
jigsaw puzzle that links areas that were previously unconnected: the new piece is problem 
solving. Problem solving is a piece of problem that fits with syntax on one side, and with 
Turing completeness on the other side. 

15 • We are now ready to understand Turing completeness from the perspective of prob- 
lem solving. After finding that creativity is the peculiarity of Turing complete problem 
resolvers, we see how our Turing complete problem solving machinery explains both our 
creativity and our pan-syntactical capability. But before that, we have also seen that Tur- 
ing complete problem solving needs sentences, functions, and conditionals; all of them 
employed by syntax. 

16 • The conclusion is that syntax and problem solving should have co-evolved in humans 
towards Turing completeness. 
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